Bounding the cardinality of aggregate views through domain-derived constraints

نویسندگان

  • Paolo Ciaccia
  • Matteo Golfarelli
  • Stefano Rizzi
چکیده

Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. This paper proposes an approach based on cardinality constraints, derived a-priori from the application domain, which may bound either the cardinality of a view or the ratio between the cardinalities of two views. We face the problem by first computing satisfactory bounds for the cardinality, then by capitalizing on these bounds to determine a good probabilistic estimate for it. In particular, we propose a bounding strategy which achieves an effective trade-off between the tightness of the bounds produced and the computational complexity. 2002 Elsevier Science B.V. All rights reserved.

منابع مشابه

On Estimating the Cardinality of Aggregate Views

Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. While the warehouse is under development and data are not available yet, the approaches based on accessing data cannot be adopted. This paper proposes an approach to estimate the cardinality of views based on a-priori information derived from the application domain. We face th...

متن کامل

Using Domain-Derived Constraints

Accurately estimating the cardinality of aggregate views is crucial for logical and physical design of data warehouses. While the warehouse is under development and data are not available yet, the approaches based on accessing data cannot be adopted. This paper reports on the progress of an ongoing research aimed at devising a comprehensive approach to estimate the cardinality of views based on...

متن کامل

Techniques for logical design and ef fi cient querying of data warehouses

Sommario Logical design of data warehouses (DW) encompasses the sequence of steps which, given a core work-load, determine the logical scheme for the DW. A key step in logical design is view materialization. In this paper we propose an original approach to materialization in which the workload is characterized by the presence of complex queries represented by Nested Generalized Projection/Selec...

متن کامل

On the Expected Probability of Constraint Violation in Sampled Convex Programs

In this note, we derive an exact expression for the expected probability V of constraint violation in a sampled convex program (see Calafiore and Campi in Math. Program. 102(1):25–46, 2005; IEEE Trans. Autom. Control 51(5):742–753, 2006 for definitions and an introduction to this topic): V = expected number of support constraints 1 + number of constraints . This result (Theorem 2.1) is obtained...

متن کامل

Propagation of ESCL Cardinality Constraints with Respect to CEP Queries

Semantic optimization of database queries, i.e. the use of metadata (constraints) for query optimization, is well investigated and has led to signi cant performance gains. The algorithm semantically rewriting database queries presented in [2] is applicable to CEP queries with some adaptions. One of these adaptions concerns the treatment of views and the constraints for them. To reduce the numbe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:
  • Data Knowl. Eng.

دوره 45  شماره 

صفحات  -

تاریخ انتشار 2003